
(Amended) A [system] method as in Claim Jf^T^ wherein: 




television or radio -broadcast signals; and 



the data acquired from the first data source are 



the data acquired from the second data source are 



computer-readable data files. 



5^ 



(Amended) A method for identifying the boundaries of 
segments in a body of information, each segment comprising a 
contiguous related set of information in the body of information, 
wherein the body of information is represented by a set of text 
data, a set of video data, and a set of audio data^ the method 
comprising the steps of: 

performing a coarse partitioning method, the coarse 
partitioning method further comprising the steps of: 



determining approximate segment boundaries within 
the body of information as the times of occurrence of 
the time-stamp markers; 

for each approximate segment boundary, specifying a 
range of time that includes the time of occurrence of the 
approximate segment boundary; 

extracting subsets of audio data from the set of audio 
data that occur during the specified ranges of time; 

performing a fine partitioning method to identify one 
or more breaks in the set of audio data; 



identifying time-stamped markers in the set of 



text data; and 
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selecting the best break that occurs in each subset of 
audio data, the time of occurrence of the best break in each 
subset being designated as a boundary of a segment in the 
body of informations- 
ascertaining a synchronization of the audio data and 
the video data; and 

determining the location of the segment boundaries in 
the set of video data using the previously determined 
location of the segment boundaries in the set of audio data 
and the synchronization of the audio data and video data. 




Please enter the following new claims: 



P^^. (New) A method as in Claim further comprising the 

step of causing the display of the portion or representation of 
the second segment to occur substantially coextensive in time 
with the display of the related first segment. 

>9. A method as in Claim wherein: 

the step of acquiring data representing the body of 
information further comprises the step of acquiring 
audiovisual data representing at least a portion of the body 
of information, wherein the first and second segments are 
represented by audiovisual data; and 

the step of generating a display of a first segment of 
the body of information further comprises the step of 
generating an audiovisual display of the first segment. 
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A method as in Claim further comprising the step of 




identifying the selection of a second segment for which a portion 
or representation is being displayed, wherein selection of such 
second segment causes an audiovisual display of the selected 
second segment to be produced. 



J?^. A method as in Claim J?5 ^ wherein: 

the step of acquiring data representing the body of 
information further comprises the step of acquiring 
audiovisual data representing at least a portion of the body 
of information; 

the step of generating a display of a first segment of 
the body of information further comprises the step of 
generating an audiovisual display of the first segment; and 
the step of generating a display of a portion of, or a 
representation of, a second segment of the body of 
information further comprises the step of generating a text 
display of the portion or representation of the second 
segment. 



the step of generating a display of a first segment of 
the body of information further comprises the step of 
generating a display of the first segment on an analog 
display device; and 




A method as in Claim wherein: 
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the step of generating a display of a portion of, or a 
representation of, a second segment of the body of 
information further comprises the step of generating a 
display of the portion or representation of the second 
segment on a digital display device. 

A method as in Claim ^S^, wherein: 

the step of generating a display of the first segment 
on an analog display device further comprises the step of 
generating a display of the first segment on a television; 
and 

the step of generating a display of the portion or 
representation of the second segment on a digital display 
device further comprises the step of generating a display of 
the portion or representation of the second segment on a 
computer display monitor. 



method as in Claim >5, further comprising the step of 



identifying the subject matter content of a segment of the body 
of information, wherein the step of comparing further comprises 
the step of determining the similarity of the subject matter 
content of a segment to the subject matter content of a different 
segment, the predetermined criteria including a predefined degree 
of similarity with respect to which the relatedness of the 
compared segments is determined. 
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A method as in Claim 



A method as in Claim 1^ , wherein the step of 
determining the similarity of the subject matter of segments 
further comprises the step of performing a relevance feedback 
method , 



AO 

method as in Claim wherein the step of acquiring 



data further comprises the step of acquiring television broadcast 
signals . 

A method as in Claim ^/^^ wherein the step of acquiring 
data further comprises the step of acquiring radio broadcast 
signals. 

, A method as in Claim wherein the step of acquiring 

data further comprises the step of acquiring computer-readable 
data files over a computer network from an information providing 
site that is part of that network, 

^ , A method as in Claim wherein the step of acquiring 

data further comprises the steps of: 

acquiring television broadcast signals; and 

acquiring computer-readable data files over a computer 

network from an information providing site that is part of 

that network. 
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j8^. a method as in Claim wherein: 

the first segment is represented by data produced from 
the television broadcast signals; and 

the second segment is represented by data from the 
computer-readable data files - 

y&i , A method as in Claim further comprising the step of 

identifying an instruction from a user to begin displaying at 
least some of the body of information, wherein the display of a 
first segment is begun in response to the user instruction. 



3^ 

A method as in Claim >3r. 



A method as in Claim j^, wherein the first and second 
segments are displayed on physically separate display devices. 

A method as in Claim ^3^, wherein the steps of storing 
the acquired data, generating a display of a first segment of the 
body of information, and generating a display of a portion of, or 
a representation of, a second segment of the body of information 
are performed by devices interconnected to a conventional 
computer bus that enables the devices to communicate with each 
other such that the devices do not require wire communication 
over network communication lines to communicate with each other. 

, A method as in Claim wherein at least some of the 

acquired data is digital data, the step of acquiring data further 
comprising the step of acquiring digital data. 
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A method as in Claim wherein at least some of the 
acquired data is analog data^ the step of acquiring data further 
comprising the step of acquiring analog data. 



(ff3 

A computer readable medium as in Claim further 



comprising instructions for causing the display of the portion or 
representation of the second segment to occur substantially 
coextensive in time with the display of the related first 
segment . 

A computer readable medium as in Claim ^ , wherein: 
the instructions for acquiring data representing the 
body of information further comprise instructions for 
acquiring audiovisual data representing at least a portion 
of the body of information, wherein the first and second 
segments are represented by audiovisual data; and 

the instructions for generating a display of a first 
segment of the body of information further comprise 
instruction for generating an audiovisual display of the 
first segment. 

^f€, A computer readable medium as in Claim further 
comprising instructions for identifying the selection of a second 
segment for which a portion or representation is being displayed, 
wherein selection of such second segment causes an audiovisual 
display of the selected second segment to be produced. 



- 8 - 





A computer readable medium as in Claim ^£r9, wherein: 
the instructions for acquiring data representing the 
body of information further comprise instructions for 



of the body of information; 

the instructions for generating a display of a first 
segment of the body of information further comprise 
instructions for generating an audiovisual display of the 
first segment; and 

the instructions for generating a display of a portion 
of, or a representation of, a second segment of the body of 
information further comprise instructions for generating a 
text display of the portion or representation of the second 
segment . 



A computer readable medium as in Claim wherein: 
the instructions for generating a display of a first 
segment of the body of information further comprise 
instructions for generating a display of the first segment 
on an analog display device; and 

the instructions for generating a display of a portion 
of, or a representation of, a second segment of the body of 
information further comprise instructions for generating a 
display of the portion or representation of the second 
segment on a digital display device. 



acquiring audiovisual data representing at least a portion 




h3 
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. A computer readable medium as in Claim wherein: 
the instructions for generating a display of the first 
segment on an analoa display device further comprise 
instructions for generating a display of the first segment 
on a television; and 

the instructions for generating a display of the 
portion or representation of the second segment on a digital 
display device further comprise instructions for generating 
a display of the portion or representation of the second 
segment on a computer display monitor. 



^9^, A computer readable medium as in Claim further 
comprising instructions for identifying the subject matter 
content of a segment of the body of information, wherein the 
instructions for comparing further comprise instructions for 
determining the similarity of the subject matter content of a 
segment to the subject matter content of a different segment, the 
predetermined criteria including a predefined degree of 
similarity with respect to which the relatedness of the compared 
segments is determined. 



instructions for determining the similarity of the subject matter 
of segments further comprise instructions for performing a 
relevance feedback method. 



7» 
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A computer readable medium as in Claim^^, wherein the 



instructions for acquiring data further comprise instructions for 
acquiring television broadcast signals. 

9^. A computer readable medium as in Claim , wherein the 
instructions for acquiring data further comprise instructions for 
acquiring radio broadcast- signals. 



n 



(o5 

A computer readable medium as in Claim wherein the 



instructions for acquiring data further comprise instructions for 
acquiring computer-readable data files over a computer network 
from an information providing site that is part of that network. 

A computer readable medium as in Claim wherein the 

instructions for acquiring data further comprise: 

instructions for acquiring television broadcast 
signals ; and 

instructions for acquiring computer-readable data files 

over a computer network from an information providing site 
that is part of that network. 



A computer readable medium as in Claim wherein; 



the first segment is represented by data produced from 
the television broadcast signals; and 

the second segment is represented by data from the 
computer-readable data files. 
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A computer readable medium as in Claim further 
comprising instructions for identifying an instruction from a 
user to begin displaying at least some of the body of 
information, wherein the display of a first segment is begun in 
response to the user instruction. 

>©tJ. A computer readable medium as in Claim wherein the 

first and second segments are displayed on physically separate 
display devices. 



fo3 

. A computer readable medium as in Claim wherein the 



instructions for storing the acquired data, generating a display^ 
of a first segment of the body of information, and generating a 
display of a portion of, or a representation of, a second segment 
of the body of information are executed by devices interconnected 
to a conventional computer bus that enables the devices to 
communicate with each other such that the devices do not require 
wire communication over network communication lines to 
communicate with each other. 

, A computer readable medium as in Claim 5^, wherein at 
least some of the acquired data is digital data, the instructions 
for acquiring data further comprising instructions for acquiring 
digital data. 
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ju06 . A computer readable medium as in Claim ^5^, wherein at 
least some of the acquired data is analog data, the instructions 
for acquiring data further comprising instructions for acquiring 
analog data. 



J^Q^ . A computer readable medium as in Claim ^,&cr, wherein the 
instructions for determining the degree of similarity further 
comprise instructions for performing a relevance feedback method. 

H 3 

A computer readable medium as in Claim ^<r, wherein the 
instructions for identifying one or more of the previously 
categorized segments as relevant to the uncategorized segment 
further comprise: 

instructions for identifying a plurality of the 
previously categorized segments that are the most similar to 
the uncategorized segments- 
instructions for determining the degree of similarity 
between each of the plurality of previously categorized 
segments and each other of the plurality of previously 
categorized segments; 

instructions for eliminating, for each pair of 
previously categorized segments of the plurality of 
previously categorized segments having greater than a 
predefined degree of similarity, one of the pair of 
previously categorized segments from the. plurality of 
previously categorized segments, wherein the remaining 
previously categorized segment or segments are similar and 



S3 
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distinct previously categorized segments; and 

instructions for identifying one or more of the similar 
and distinct previously categorized segments as relevant 
previously categorized segments. 

XQ^, A computer readable medium as in Claim >er^ wherein the 
instructions for selecting one or more subject matter categories 
further comprise instru-^ tions for selecting the most frequently 
occurring subject matter category or categories associated with 
the relevant previously categorized segments. 

JU©^ . A computer readable medium as in Claim 6^, wherein the 
uncategorized segment has been acquired from a first data source 
and the previously categorized segment or segments have been 
acquired from a second data source that is different than the 
first data source. 

lO^*. A computer readable medium as in Claim >0T, wherein: 
the data acquired from the first data source are 

television or radio broadcast signals; and 

the data acquired from the second data source are 

computer-readable data files. 

J-©^. A system for categorizing according to subject matter 
an uncategorized segment of a body of information that includes a 
plurality of segments, each segment representing a defined set of 
information in the body of information, one or more segments of 
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the body of information having previously been categorized by 
identifying each of the one or more segments with one or more 
subject matter categories, the system comprising: 

means for determining the degree of similarity between 
the subject matter content of the uncategorized segment and 
the subject matter content of each of the previously 
categorized segments ; 

means for identifying one or more of the previously 
categorized segments as relevant to the uncategorized 
segment based upon the determined degrees of similarity of 
subject matter content between the uncategorized segment and 
the previously cat--^ gorized segments; and 

means for selecting one or more subject matter 
categories with which to identify the uncategorized segment 
based upon the subject matter categories used to identify 
the relevant previously categorized segments. 

J>cr, A system as in Claim J^Q^ , wherein the means for 
determining the degree of similarity further comprises means for 
performing a relevance feedback method. 

(03 101 

JUK . A system as in Claim wherein the means for 

identifying one or more of the previously categorized segments as 
relevant to the uncategorized segment further comprises: 

means for identifying a plurality of the previously 

categorized segments that are the most similar to the 

uncategorized segment ; 
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means for determining the degree of similarity between 
each of the plurality of previously categorized segments and 
each other of the plurality of previously categorized 
segments ; 

means for elirinating, for each pair of previously 
categorized segments of the plurality of previously 
categorized segments having greater than a predefined degree 
of similarity, one of the pair of previously categorized 
segments from the plurality of previously categorized 
segments, wherein the remaining previously categorized 
segment or segments are similar and distinct previously 
categorized segments; and 

means for identifying one or more of the similar and 
distinct previously categorized segments as relevant 
previously categorized segments. 

A system as in Claim JU©^, wherein the means for 
selecting one or more subject matter categories further comprises 
means for selecting the most frequently occurring subject matter 
category or categories associated with the relevant previously 
categorized segments . 

JJ*^. A system as in Claim wherein the uncategorized 

segment has been acquired from a first data source and the 
previously categorized segment or segments have been acquired 
from a second data source that is different than the first data 
source . 
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jUW. A system as in Claim J^-tT, wherein: 

the data acquired from the first data source are 
television or radio broadcast signals; and 

the data acquired from the second data source are 
computer-readable data files. 

. A computer readable medium as in Claim ^ , wherein the 
first type of data is audiovisual data and the second type of 
data is text data. 

i^^, A computer readable medium as in Claim JM^, wherein the 
instructions for determining the degree of similarity further 
comprise instructions for performing a relevance feedback method. 

>^7. A computer readable medium as in Claim Jeff, wherein a 
plurality of sets of information, each different from the other 
sets of the plurality of sets of information, are each 
represented by an associated set of data of the second type, the 
one or more computer programs enabling determination of which, if 
any, of the plurality of sets of information represented by a set 
of data of the second type are relevant to the first set of 
information represented by the set of data of the first type, the 
one or more computer programs further comprising: 
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instructions for determining the degree of similarity 
between each set o-P data of the second type representing one 
of the plurality of sets of information and the derived set 
of data of the second type representing the first set of 
information ; 

instructions for identifying which, if any, of the sets 
of data of the second type representing one of the plurality 
of sets of information have greater than a predefined degree 
of similarity to the derived set of data of the second type 
representing the first set of information, the sets of data 
of the second type so identified being termed similar sets 
of data of the second type; 

instructions for determining the degree of similarity 
between each similar set of data of the second type and each 
other similar set of data of the second type; 

instructions for eliminating, for each pair of similar 
sets of data of the second type having greater than a 
predefined degree of similarity, one of the pair of similar 
sets of data of the second type from the set of similar sets 
of data of the second type, wherein the remaining set or 
sets of similar data of the second type are similar and 
distinct sets of data of the second type; and 

instructions for identifying the set or sets of 
information corresponding to one or more of the similar and 
distinct sets of data of the second type as relevant to the 
second set of information. 
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J-toT a computer readable medium as in Claim -W^, wherein 
the instructions for identifying the relevant set or sets of 
information further comprise instructions for identifying no more 
than a predetermined number of relevant sets of information, the 
predetermined number of relevant sets of information 
corresponding to the se+ ~j of data of the second type having the 
greatest degree of similarity to the derived set of data of the 
second type. 

J>l^. A computer readable medium as in Claim JfV, wherein the 
first type of data is analog data and the second type of data is 
digital data, 

A system for determining whether a first set of 
information represented by a set of data of a first type is 
relevant to a second set of information represented by a set of 
data of a second type, the first and second sets of information 
being different from each other, the system comprising: 

means for deriving a set of data of the second type 
from the set of data of the first type, the derived set of 
data of the second type also being representative of the 
first set of information; 

means for determining the degree of similarity between 
the set of data of the second type representing the second 
set of information and the derived set of data of the second 
type representing the first set of information; and 
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means for determining whether the first set of 
information is relevant to the second set of information 
based upon the degree of similarity between the set of data 
of the second type representing the second set of 
information and the derived set of data of the second type 
representing the first set of information, 

. A system as in Claim 12^, wherein the first type of 
data is audiovisual data and the second type of data is text 
data . 

X3r^. A system as in Claim iS^*, wherein the means for 
determining the degree of similarity further comprises means for 
performing a relevance feedback method. 

>a-y. A system as in Claim i3^, wherein a plurality of sets 
of information, each different from the other sets of the 
plurality of sets of information, are each represented by an 
associated set of data of the second type, the system enabling 
determination of which, if any, of the plurality of sets of 
information represented by a set of data of the second type are 
relevant to the first set of information represented by the set 
of data of the first type, the system further comprising: 
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means for determining the degree of similarity between 
each set of data of the second type representing one of the 
plurality of sets f.f information and the derived set of data 
of the second type representing the first set of 
informations- 
means for identifying which, if any, of the sets of 
data of the second type representing one of the plurality of 
sets of information have greater than a predefined degree of 
similarity to the derived set of data of the second type 
representing the first set of information, the sets of data 
of the second type so identified being termed similar sets 
of data of the second types- 
means for determining the degree of similarity between 
each similar set of data of the second type and each other 
similar set of data of the second type; 

means for eliminating, for each pair of similar sets of 
data of the second type having greater than a predefined 
degree of similarity, one of the pair of similar sets of 
data of the second type from the set of similar sets of data 
of the second type, wherein the remaining set or sets of 
similar data of the second type are similar and distinct 
sets of data of the second type; and 

means for identifying the set or sets of information 
corresponding to one or more of the similar and distinct 
sets of data of th^- second type as relevant to the second 
set of information. 
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IS, 



A system as in Claim wherein the means for 



identifying the relevant set or sets of information further 
comprises means for identifying no more than a predetermined 
number of relevant sets of information, the predetermined number 
of relevant sets of information corresponding to the sets of data 
of the second type having the greatest degree of similarity to 
the derived set of data of the second type. 



12^, A system as in Claim ySr&, 



system as in Claim VSr&f wherein the first type of 
data is analog data and the second type of data is digital data, 

J^S^ A computer readable medium as in Claim J^rf, wherein the 
instructions for performing a fine partitioning method further 
comprise instructions for identifying the best breaks using a 
process that includes scene break identification. 

J*T, A computer readable medium as in Claim wherein the 

fine partitioning methoc is performed on the entire set of video 
data to identify all of the breaks in the set of video data. 

12r&. A computer readable medium as in Claim p^, wherein the 
fine partitioning method is performed only on the subsets of 
video data to identify only breaks that occur in the subsets. 

J*^. A computer readable medium as in Claim wherein the 

best break of each subset is determined according to the criteria 
of the fine partitioning method used. 
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J^dfl^. A computer readable medium as in Claim wherein the 

best break of each subset is the break occurring closest in time 
to the time of occurrence of the segment boundary in the text 
data that corresponds to that subset. 

. A computer readable medium as in Claim wherein the 

body of information is represented by a set of text data, a set 
of audio data and a set of video data, the one or more computer 
programs further comprising: 

instructions for ascertaining a synchronization of the 
audio data and the video data; and 

instructions for determining the location of the 
segment boundaries in the set of audio data using the 
previously determined location of the segment boundaries in 
the set of video data and the synchronization of the audio 
data and video data. 

Il3 

A computer readable medium encoded with one or more 
computer programs for identifying the boundaries of segments in a 
body of information, each segment comprising a contiguous related 
set of information in the body of information, wherein the body 
of information is repre-onted by a set of text data, a set of 
video data, and a set of audio data, comprising: 

instructions for performing a coarse partitioning 
method, the instructions for performing a coarse 
partitioning method further comprising: 
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instructions for identifying time-stamped markers 

in the set of text data; and 

instructions for determining approximate segment 

boundaries witiiin the body of information as the times 

of occurrence of the time-stamp markers; 

instructions for specifying, for each approximate 
segment boundary, a range of time that includes the time of 
occurrence of the approximate segment boundary; 

instructions for extracting subsets of audio data from 
the set of audio data that occur during the specified ranges 
of time; 

instructions for performing a fine partitioning method 
to identify one or more breaks in the set of audio data; 

instructions for selecting the best break that occurs 
in each subset of ^;odio data, the time of occurrence of the 
best break in each subset being designated as a boundary of 
a segment in the body of information; 

instructions for ascertaining a synchronization of the 
audio data and the video data; and 

instructions for determining the location of the 
segment boundaries in the set of video data using the 
previously determined location of the segment boundaries in 
the set of audio data and the synchronization of the audio 
data and video data. 
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A computer readable medium as in Claim J-d^T, wherein the 
instructions for performing fine partitioning further comprise 
instructions for identifying the best breaks using a process that 



iJ^. A computer readable medium as in Claim >3-2, wherein the 
instructions for performing fine partitioning further comprise 
instructions for identifying the best breaks using a process that 
includes voice recognition, 

Mb U3 

>3^. A computer readable medium as in Claim wherein the 

instructions for performing fine partitioning further comprise 
instructions for identifying the best breaks using a process that 
includes word recognition. 

13^- A computer readable medium as in Claim 1^3^, wherein the 
instructions for performing fine partitioning further comprise 
instructions for identifying the best breaks using a process that 
includes music recognit.. on . 

A system for identifying the boundaries of segments in 
a body of information, each segment comprising a contiguous 
related set of information in the body of information, wherein 
the body of information is represented by at least a set of text 
data and a set of video data, the system comprising: 



includes pause recognition. 



ll3 
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means for perlorming a coarse partitioning method, the 
means for performing a coarse partitioning method further 
comprising: 

means for identifying time-stamped markers in the 

set of text data; and 

means for determining approximate segment 

boundaries within the body of information as the times 

of occurrence of the time-stamp markers; 

means for specifying, for each approximate segment 
boundary, a range of time that includes the time of 
occurrence of the approximate segment boundary; 

means for ext:. dieting subsets of video data from the set 
of video data that occur during the specified ranges of 
time ; 

means for performing a fine partitioning method to 
identify one or more breaks in the set of video data; and 

means for selecting the best break that occurs in each 
subset of video data, the time of occurrence of the best 
break in each subset being designated as a boundary of a 
segment in the body of information , 



performing a fine parti oning method further comprises means for 
identifying the best breaks using a process that includes scene 
break identification . 




A system as in Claim 




wherein the means for 
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A system as in Claim i^TT, wherein the fine partitioning 
method is performed on the entire set of video data to identify 
all of the breaks in the set of video data. 

JJir&, A system as in Claim wherein the fine partitioning 

method is performed only on the subsets of video data to identify 
only breaks that occur in the subsets. 

1X2. Kg' 

X4n . A system as in Claim Je3^, wherein the best break of 
each subset is determined according to the criteria of the fine 
partitioning method used. 

113 

A system as in Claim wherein the best break of 

each subset is the break occurring closest in time to the time of 
occurrence of the segment boundary in the text data that 
corresponds to that subi;c;t. 

J^^. A system as in Claim wherein the body of 

information is represented by a set of text data, a set of audio 
data and a set of video data, the system further comprising: 

means for ascertaining a synchronization of the audio 
data and the video data; and 

means for determining the location of the segment 
boundaries in the set of audio data using the previously 
determined location of the segment boundaries in the set of 
video data and the synchronization of the audio data and 
video data. 
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A system for identifying the boundaries of segments in 



a body of information, each segment comprising a contiguous 
related set of information in the body of information, wherein 
the body of information is represented by a set of text data, a 
set of video data, and a set of audio data, the system 
comprising : 

means for performing a coarse partitioning method, the 
means for performing a coarse partitioning method further 
comprising : 

means for identifying time-stamped markers in the 

set of text data; and 

means for determining approximate segment 

boundaries within the body of information as the times 

of occurrence of the time-stamp markers; 

means for specifying, for each approximate segment 
boundary, a range of time that includes the time of 
occurrence of the approximate segment boundary; 

means for extracting subsets of audio data from the set 
of audio data that occur during the specified ranges of 
time; 

means for performing a fine partitioning method to 
identify one or more breaks in the set of audio data; 

means for selecting the best break that occurs in each 
subset of audio data, the time of occurrence of the best 
break in each subset being designated as a boundary of a 
segment in the body of information; 
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means for ascertaining a synchronization of the audio 
data and the video data; and 

means for determining the location of the segment 
boundaries in the set of video data using the previously 
determined location of the segment boundaries in the set of 
audio data and the synchronization of the audio data and 
video data. 



3 



system as in Claim wherein the means for 

performing a fine partitioning method further comprises means for 
identifying the best breaks using a process that includes pause 
recognition . 

I i i A system as in Claim , wherein the means for 

performing a fine partitioning method further comprises means for 
identifying the best breaks using a process that includes voice 
recognition . 

vi^ us- 

JJr?'. A system as in Claim J-^, wherein the means for 
performing a fine partitioning method further comprises means for 
identifying the best breaks using a process that includes word 
recognition . 

J-4e*. A system as ii) Claim 4-4^, wherein the means for 
performing a fine partitioning method further comprises means for 
identifying the best breaks using a process that includes music 
recognition . 
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